Run-time Adaptation for Multi-Core Execution
ثبت نشده
چکیده
Contention over shared resources is one of many variables affecting parallel performance on modern multi-core processors. The issue is of particular interest because it the primary factor limiting fully independent threads/processes which would otherwise showcase linear speedup with each additional core. Furthermore, addressing the problem is a non-trivial task. Shared-resource contention, while fundamentally a product of architectural design decisions, is significantly influenced by the running workload as well as code-generation decisions made at compile time. This paper investigates the effects of workload selection and code-generation on parallel performance and showcases that there is no single factor or characteristic that can be leveraged to provide a best static compilation or workload selection with respect to parallel performance. Addressing resource contention must be handled in an adaptive manner at runtime. To this end, this paper proposes a hybrid runtime system that uses hardware performance counters to influence runtime code selection. The system is evaluated using a subset of the SPEC CPU2006 benchmark suite and is demonstrated across variety of x86 multi-core architectures.
منابع مشابه
Efficient parallelization of the genetic algorithm solution of traveling salesman problem on multi-core and many-core systems
Efficient parallelization of genetic algorithms (GAs) on state-of-the-art multi-threading or many-threading platforms is a challenge due to the difficulty of schedulation of hardware resources regarding the concurrency of threads. In this paper, for resolving the problem, a novel method is proposed, which parallelizes the GA by designing three concurrent kernels, each of which running some depe...
متن کاملAn integrated simulation-DEA approach to multi-criteria ranking of scenarios for execution of operations in a construction project
The purpose of this study is to examine different scenarios for implementing operations in the pre-construction phase of a project, based on several competing criteria with different importance levels in order to achieve a more efficient execution plan. This paper presents a new framework that integrates discrete event simulation (DES) and data envelopment analysis (DEA) to rank different scena...
متن کاملInter-cluster Thread-to-core Mapping and DVFS on Heterogeneous Multi-cores
Heterogeneous multi-core platforms that contain different types of cores, organized as clusters, are emerging, e.g. ARM’s big.LITTLE architecture. These platforms often need to deal with multiple applications, having different performance requirements, executing concurrently. This leads to generation of varying and mixed workloads (e.g. compute and memory intensive) due to resource sharing. Run...
متن کاملWeighted Execution Time Analysis of Applications on COTS Multi-core Architectures Weighted Execution Time Analysis of Applications on COTS Multi-core Architectures
Commercial off-the-shelf multi-core architectures could significantly reduce costs and timeto-market of hard real-time systems. However, due to the unpredictable interference on the shared memory, the worst case execution time is either non-deterministic or overly pessimistic. Typically, the pessimism originates from the conservative assumption of maximum interference for each memory access. Th...
متن کاملA Lightweight Run-time Support for Fast Dense Linear Algebra on Multi-core
The work proposes ffMDF, a lightweight dynamic run-time support able to achieve high performance in the execution of dense linear algebra kernels on shared-cache multi-core. ffMDF implements a dynamic macro-dataflow interpreter processing DAG graphs generated on-the-fly out of standard numeric kernel code. The experimental results demonstrate that the performance obtained using ffMDF on both fi...
متن کامل